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54) Soluble LDL receptor and gene. 



A truncation of the human LDL receptor gene has been created which can be expressed in a 
heterologous host to produce a soluble fragment to the receptor protein. The gene can be expressed in 
insect cells in culture to produce a protein fragment which is not only water soluble but which also 
retains affinity for binding to LDL The truncation is a truncation of the carboxyl terminus of the native 
LDL receptor gene which results in a 354 amino acid protein fragment designated LDL-R- 364 . 
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Field of the Invention 

The present invention relates to the general field of molecular biology and relates, in particular, to the iso- 
ation and utilizatic n of novel gene fragments to create useful fragments of proteins for therapeutic and research 
5 purposes. 

Background of the Invention 

Cholesterol is an abundant substance in animals because it has an essential function in the membranes 

10 of animal cells. Cholesterol is almost completely insoluble in aqueous solution. Within the body other than in 
membranes, therefore, cholesterol must be carefully segregated from exposure to aqueous environments. 
Therefore, in multicellular organisms, cholesterol is transported about the body packaged within the hydro- 
phobic cores of plasma lipoproteins. The various plasma lipoproteins in humans have been classified into four 
major classes, principally by buoyant density. The four major classes are very low density lipoprotein i VLDL), 

15 intermediate density lipoprotein (IDL) low density lipoprotein (LDL), and high density lipoprotein (HDL). A fifth 
class of lipoprotein is chylomicrons, which occur only after feeding. The most abundant cholesterol-carrying 
lipoproteins in human plasma are of the LDL class, and high levels of LDL circulating in blood plasma are as- 
sociated with arteriosclerosis and coronary dysfunction. Hence, the mechanisms which purge LDL from the 
blood stream have been subject to significant study. 

20 In higher animals, LDL is scavenged from the blood stream by receptors located on the cell membranes 

of animal cells. The receptors are known as the LDL receptor. The structure of the LDL receptor has been gen- 
erally described, and the gene coding for the production of the receptor has been isolated and sequenced. 
Descriptions of the LDL receptor, its structure, and its general functionality in interaction with other cellular 
processes can be found in Schneider, Bio. EL Biophys. Acta ., 988, pages 307-317 (1989) and Hobbs, et al., 

25 Annu. Rev. Genet ., 24, pages 133-170 (1990). The LDL receptor protein is a cell surface glycoprotein that reg- 
ulates plasma cholesterol by mediating endocytosis of lipoproteins. The human LDL receptor is a protein of 
860 amino acids encoded by a gene which actuates the transcription of an mRNAof 5.3 kb in length. The rnRNA 
of the human LDL receptor gene includes a long 3' untranslated region, as well as a signal peptide which ac- 
tuates transport of the protein to the plasma membrane, and which is cleaved from the mature form of the 

3C protein. 

The LDL receptor protein has been characterized as having generally five different domains or regions. 
One region, referred to as the ligand binding domain, is generally associated with the affinity of the receptor 
for the two known proteins which are bound by the receptor, ApoB-100, the 550 kd glycoprotein that is the 
predominant protein of LDL. and ApoE, a 34 kd protein found in multiple copies in IDL and a subclass of HDL. 

35 Other domains in the mature LDL receptor protein include an EGF (epidermal growth factor) precursor hom- 
ology region which, while not specifically necessary for ligand affinity, appears to play a role in the presentation 
of the LDL receptor to lipoproteins external to the cell. The third domain of the LDL receptor is a region con- 
taining clustered O-linked sugar chains. Amembrane spanning region of 22 amino acids, of high hydrophobicity. 
is responsible for the membrane bound nature of the protein. At the carboxyl end of the protein is a domain 

40 of 50 amino acids, which extends into the cytoplasm. 

While the human LDL receptor gene and the human LDL receptor protein have been studied extensively, 
this study has been a difficult and, at times, an arduous process. One of the reasons for the difficulty of this 
study is the inability of researchers to isolate and utilize large quantities of the LDL receptor protein itself. While 
the protein is produced in significant quantities in animal cells, it is membrane bound and presumably insoluble 

45 in aqueous solution. Hence, it is quite difficult to isolate the receptor protein from mammalian tissues in any 
significant quantity. In addition, since the affinity for membranes is quite strong, it is difficult to express the 
human LDL receptor protein in heterologous hosts and isolate pure quantities of protein, due to the strong ten- 
dency of the protein to associate with membranes in the expression system. In addition, the protein has a com- 
plex three dimensional structure, particularly in the ligand binding domain, which is cysteine-rich and includes 

so multiple disulfide bonds. Since the three dimensional structure of the protein is essential for the affinity of the 
receptor to LDL, production of the protein in hosts which do not properly process proteins subsequent to trans- 
lation to result in a correct three dimensional structure would not give nse to functional receptor protein. 

Summary of the Invention 

55 

The present invention is summarized in that a truncated portion of the human LDL receptor gene has been 
identified which can be expressed in insect cells in culture to produce a truncated water-soluble form of the 
LDL receptor protein which binds to, and has specific affinity for, LDL to which the native receptor has affinity. 
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It is an object of the present invention to describe a gene fragment and the protein produced from this 
fragment, which enables the production and accumulation of significant quantities of an LDL receptor fragment 
which has affinity for and recognizes LDL, and yet which is soluble in aqueous solutions. 

It is another object of the present invention to provide a new substance of possible therapeutic value, spe- 
5 cities! !y a soluble LDL receptor protein fragment which can be produced in heterologous hosts for possible clin- 
ical, diagnostic, and research use on human systems. 

It is a further object of the present invention to elucidate and describe a mechanism by which the active 
binding portion of the LDL receptor can be created in quantities in heterologous hosts, so as to be useful for 
further research purposes in further investigating the interaction of the receptor and its target molecules. 
10 It is yet another object of the present invention to describe possible practical applications made possible 

by the expression of the water-soluble LDL receptor fragment described here. 

Other objects, advantages, and features of the present invention will become apparent from the following 
specification when taken in conjunction with the accompanying drawings. 

15 Brief Description of the Drawings 

Figure 1 is a schematic view of the current model for a human LDL receptor protein. 
Figure 2 is a model of the predicted schematic structure of the truncated LDL receptor LDL-R 364 . 
Figure 3 is a graphical representation of a competition binding assay verifying the binding of the LDL re- 
20 ceptor produced in accordance with the present invention with LDL. 

Detailed Description of the Invention 

In accordance with the present invention, it has been found that a truncated portion of the gene for human 
25 low density lipoprotein receptor (LDL-R) enables the production of a soluble form of the LDL receptor which 
retains the binding specificity of the native LDL receptor. The expression of this truncated gene in insect cells 
in culture results in the production of a truncated LDL receptor protein which appears to be conformationally 
correct, and which, although soluble in aqueous solution, exhibits proper specificity for LDL proteins so as to 
be useful for LDL binding. 

30 The mechanism of the present invention may be best understood by referring first to Figure 1 which is a 

graphical illustration of the currently best understood model of the LDL receptor, and the various domains which 
constitute that protein. The functional domains of the LDL receptor protein had been identified by virtue of hom- 
ology to other proteins, and also by virtue of the phenotypes of naturally occurring mutants of the protein found 
in the human population, which have been extensively studied by Brown and Goldstein. Five domains of the 

35 mature protein have been characterized. The first is an N-terminal cystetne-rich domain, which is associated 
with ligand binding, and referred to as the ligand binding domain in Figure 1. The next region in the protein is 
a region having homology to the epidermal growth factor (EGF) precursor. The EGF precursor homology do- 
main is required for the acid-dependent disassociation of the lipoproteins from the receptor, during receptor 
recycling, and also serves to position the ligand-binding domain so that it can bind LDL on the cell surface. 

40 The third domain of the protein is a region rich in O-linked carbohydrate. The next region of the protein is a 
membrane spanning domain, of extremely hydrophobic character, which is the portion of the protein respon- 
sible for binding to the animal cell membrane. A cytoplasmic domain at the extreme carboxyt terminus of the 
protein completes the regions of the protein. 

It is previously known that virtually all polymorphisms in the human receptor that are associated with de- 

45 fective binding of LDL reside in the extreme N-terminal cysteine-rich domain. Based on the mutagenic analysis 
of the LDL receptor as performed by Esser, et al. ( J. Biol.Chem. . 263:26, pages 1 3282-13290 (1988)) and Rus- 
sell, et al . (J. Biol.Chem ., 264:36. pages 21 682-21 688 (1 989)), it is clear that all of the seven imperfect repeats 
which characterize the ligand binding domain are required, or at least involved, in optimal binding to the LDL 
ligand. Two additional imperfect repeats, designated A and B in Figure 1, are also found in the EGF precursor 

so homology domain. It seems likely from a similar mutation analysis that repeat A may also facilitate optimal ligand 
binding. 

Based on this understanding, it was decided to attempt a truncation of the full-length LDL receptor gene, 
to isolate a protein coding sequence encoding for the amino terminal portion of the protein, which would include 
the expression of all repeats 1 through 7 of the ligand binding domain plus repeat A of the region of EGF pre- 
ss cursor homology region. If a host cell expressing this truncated gene were able to facilitate the proper folding 
of the truncated protein product, then it was possible that the truncated product would retain its ability to bind 
LDL. Furthermore, the exclusion of the membrane spanning domain could possibly result in secretion of the 
produced protein. Prior to this attempt, it was not known if a conformationally correct ligand binding domain 
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could be produced without the rest of the native protein. Formation of the correct structure of complex proteins 
is still poorly understood. 

It was pre\ iously known t hat a naturally-occurring mutation of t he LDL receptor that deletes the membrane 
spanning and cytoplasmic domai is results in the partial secretion of a truncated receptor, rather than the lo- 

5 calization of all of the truncated receptor onto the membrane. However, it is not known if any biological activity 
of this extracellular mutant receptor has been detected. 

By utilization of a restriction endonuclease cleavage site in the commonly available LDL receptor cDNA 
done, it was possible to prepare a gene fragment encoding the first 354 amino acids of the mature form of 
the human LDL-R protein. The polypeptide thus created, designated here LDL-R 35 *, could be produced in a 

10 foreign host and retain specificity for the LDL molecule. The LDL-R 354 is also soluble in aqueous solution, and 
therefore has independent utility in studies of binding to LDL. 

As described below, the truncated LDL receptor gene fragment coding for LDL-R 15 * has been introduced, 
through the use of a recombinant baculovirus, into insect cells grown in culture. It is believed that the utilization 
of eukaryotic cells in general, and insect cells in particular, is an important factor in the successful production 

15 of significant quantities of receptor binding soluble protein. Because the conformational structure of the re- 
ceptor is critical to its ability to bind to the LDL proteins, if the polypeptide could not be produced in a manner 
that yielded a properly folded product in heterologous host cells, effective binding would not be possible. It is 
likely that other eukaryotic host cell systems will result in production of the same protein fragment. However, 
since the production of proteins of correct conformational character cannot be predicted, particularly for trun- 

20 cated forms of protein, it was not known in advance if a conformational^ correct receptor, i.e. one having af- 
finity to the target molecules, could be produced in any host. Nevertheless, the structure of the protein frag- 
ment expressed here does seem correct. There is also reason to believe that this particular truncation, with 
the possible exception of an amino terminal deletion of the very first repeat sequence, is probably the most 
minimal truncation that would produce active protein. Other truncations which are less severe are likely, based 

25 on the evidence presented here, to also express conformationally correct protein fragments. Any such larger 
truncations could include the EGF precursor homology region without affecting ligand binding, but should not 
include the hydrophobic membrane spanning domain from the native gene. 

The cDNA for human LDL receptor has previously been isolated by others, sequenced, and published. A 
copy of the 5.3 kbcDNA has been deposited by others with the ATCC and hosted in E. coti at Accession Number 

30 57004. From that vector, the region encoding the critical amino terminal 354 amino acid polypeptide can readily 
be isolated, by a procedure similar to that described below. 

Applications of Truncated LDL Receptor 

35 One category of use of the truncated LDL receptor protein is for diagnosis. Presently used diagnostic as- 

says of human serum LDL levels are performed by an indirect estimate method based on a precipitating poly- 
anion used to precipitate LDL with VLDL. The amount of cholesterol in VLDL is estimated by dividing total plas- 
ma triglyceride by 5. This method only estimates LDL cholesterol and is incapable of identifying patients who 
have greater numbers of LDL particles, but only modest hypercholesterolemia, because their LDL particles are 

4C simply smaller than the norm, i.e. with less cholesterol per particle. These patients may be even more at risk 
if, as some data suggest, smaller LDL particles are actually more atherogenic than larger particles. Thus a 
method of measuring either or both of protein and lipid content of the LDL would be highly desirable. Informa- 
tion as to both lipid and protein would enable estimates to be made of the mean size of LDL particles, which 
appears to be a clinically useful indication. The truncated LDL receptor described here could be used in binding 

45 assays against blood or serum to ascertain actual protein levels and/or numbers of LDL particles. 

Another potential advantage of a method of specifically capturing lipoprotein particles containing apoB 
from the blood would be potential quantitation of Lipoprotein (a), or Lp(a). Lp(a) is a protein covalently attached 
to the major LDL protein (apoB), and the plasma level of Lp(a) has proven to have a strong correlation with 
heart disease risk. The plasma level of Lp(a) appears to be both extremely variable and completely determined 

50 by genetic factors, and the level of Lp(a) results in a protein product that can vary in size by as much as 20- 
fold. The immunoassays currency used to quantitate serum Lp(a) are not valid for absolute (i.e. size indepen- 
dent) quantitation. The development of a protocol for binding LDL through the use of the LDL receptor described 
here, may permit Lp(a) to be measured, since Lp(a) is easily released from apoB by reducing agents which 
break the linking disulfide bonds. 

55 Another application of the soluble LDL receptor would be for the isolation of plasma LDL for research or 

clinical purposes. Presently LDL is isolated by preparative ultracentrifugation, which is very costly and requires 
six days of continuous ultracentrifugation time. A commercially available method or kit for LDL isolation based 
on this truncated receptor would be popular among LDL researchers, particularly since the receptor recognizes 
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apoB only on LDL and not on VLDL. An advantageous feature of the use of the truncated LDL receptor for 
affinity selection, rather than an antibody toapoB, is both this selectivity for LDL and also the fact that although 
the LDL-LDL receptor affinity is quite high, the affinity depends on calcium and can readily be dissipated 
through the use of a chelating agent such as EDTA. Antibodies to apoB typically require much harsher elution 
conditions. 

Another more dramatic possible application for this receptor would be essentially blood filtering for LDL. 
The immobilized truncated LDL receptor produced in quantity can be immobilized in a column or cartridge 
through which a patient's blood could be passed to specifically remove LDL from the blood. The column or 
cartridge could be regenerated by eluting the LDL. This therapeutic approach seems particularly adapted to 
kidney dialysis patients, who invariably suffer from hyperlipemia and who already have their blood d*alyzed 
on a regular basis. 

It is also possible to construct alternate LDL-R 354 expression vectors which condition secretion of the ex- 
pressed protein fragment. Since the mature protein is initially produced with a signal peptide, which causes 
transportation of the immature form of the protein to the plasma membrane of the cell which produces the 
protein, there is every reason to expect that a similar signal peptide strategy can be used to cause secretion 
of the truncated form of the protein described herein. 

EXAMPLES 

Construction of Gene . 

The construction of the coding sequence for the LDL-R 35 * polypeptide began with a vector designated 
pRGLDLR-13, which consists of a 1317 base pair Xbal, Rsal fragment from the plasmid pLDLR3 which has 
been deposited by others with the ATCC as Accession No. 57004. The vector pRGLDLR-13 thus includes a 
sequence encoding the first 425 amino adds of the LDL receptor, which has been inserted by blunt end ligation 
into the EcoRV site of pBluescript M13+ (Stratagene) 3*-5' orientation. A 14 base pair oligonucleotide 
(CTAGTCTAGACTAG, New England Biolabs) was used to introduce a translational termination codon in all 
reading frames, by insertion at the unique EcoRV site in pRGLDLR-13. The EcoRV site corresponds to nu- 
cleotide 1076 of the LDL-R cDNA. The resulting plasmid was designated pDL-LDLR. The 5' and 3' termini of 
the approximately 1317-bp LDL-R fragment from pDL-LDLR was confirmed by DNA sequence analysis. The 
translation terminating codon was in the predicted reading frame. An 1100 pb BamHI fragment of pDL-LDLR 
was then ligated to copies of a plasmid pAcYMI , which had been digested with BamHI and treated with alkaline 
phosphatase. The details of pAcYMI are described in Matsuura et al. J. Gen. Virol . 68:1233-1250 (1987). The 
resulting construction was designated pAcLDL-R 35 *. This transfer vector thus consists of the 5' and 3' sequenc- 
es from the Autoqrapha californica poiyhedrin gene which are flanking a protein coding sequence of nucleo- 
tides which encodes the first 354 residues of the human LDL receptor, followed by two additional amino acids 
(leucine and valine), which were introduced at the carboxyl terminus of the peptide by oligonucleotide insertion. 

Production of Recombinant Baculovirus . 

Copies of the plasmid pAcLDL-R 354 constructed as described above were co-transfected with the wild- 
type baculovirus genome (AcNPV) into Spodoptera fruqiperda Sf-21 cells. The recombinant viruses (AcNPV- 
R 3 ^ 4 ) harboring the sequence for the truncated LDL receptor were selected, plaque purified, and titered as de- 
scribed in Summers et al., A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures, 
Tex. Agric. Exp. Stn. Bull. (1987) and Miller et al. in Genetic Engineering , Plenum, New York, pp. 277-298 
(1986). 

Recombinant virus (AcNPV-R 354 ) served as the template for a polymerase chain reaction utilizing ampli- 
fying oligonucleotides homologous to the 5' and 3* sequences of the Autoqrapha californica poiyhedrin gene. 
The resulting amplification product was of a size consistent with the predicted recombination event. 

Transfection of Cultured Insect Cells and Recovery of Protein. 

The recombinant baculoviruses were then used to infect cells of SF-21 in culture. Quantities of the recom- 
binant baculovirus were infected onto dishes of 4x10 6 SF-21 cells at a multiplicity of infection of four. Two mil- 
liliters of SF-900 (Gibco) medium were harvested from the infected cells 52 hours after infection with the re- 
combinant baculovirus in TCA precipitated media to yield 60 u gram of protein. Media samples were subjected 
to sodium dodecyl sulfate, polyacrylamide gel electrophoresis (SDS-PAGE), in the absence or presence of 
beta-mercaptoethanol, with non-reducing gels of 13.3% acrylamide resolving gel with a 6% stacker and with 
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reducing gels of 10% acrytamide resolving gel with a 4.5% stacker. Immune-blotting was performed using anti- 
human LDR-R monoclonal antibody C7, and alkaline phosphatase-conjugated goat anti-mouse IgG (Sigma). 

Bindina Experiments . 

5 

Media from transfected SF-21 cells was collected 52 hours after infection and was dialyzed against PBS 
at pH 7.4. Separately, CHO cells pine TR715-19] previously demonstrated to overexpress the human LDL re- 
ceptor were grown in Hams nutrient mixture F-12 (Gibco), 20 mM Hepes, pH 7.2, 1% (V/V) fetal calf serum, 
4% (V/V) lipoprotein deficient serum (LDS), 2 mM glutamine, and 100 units per milliliter penicillin, 100 ng per 

w ml streptomycin. Twelve hours prior to use, confluent cells were switched to a medium containing Hams F-12, 
20 mM Hepes pH 7.2, 5% LDS. 2 mM glutamine, 100 units per ml penicillin, 100 u,g per ml streptomycin. 10 
uo, per ml cholesterol and 0.1 (ig per ml 25-hydroxycholesteroi. The indicated concentration of dialyzed tissue 
culture proteins (the concentration was determined by a modified Lowry assay) were incubated with 1.0 \ig of 
[ 125 I]-LDL in 2 ml Hams F-12, 20 mM Hepes, pH 7.2 and 5 mg/ml bovine serum albumin, at ambtent tempera- 

15 tures for 90 minutes. Samples were equilibrated to 4°C prior to addition to the CHO cells. The cells were incu- 
bated for 3 hours at 4°C on a rotary shaker, prior to washing, and the release of bound [125I]-LDL by dextran 
sulfate. Radio iodination of LDL was performed as described previously, e.g. Checovich et al., Biochem ., 
27:1934 (1988). 

The results as illustrated in Fig. 3 indicate that for increasing concentrations of culture media incubated 
2C with radiolabeled LDL, in the presence of cells expressing the LDL receptor, the ability of the LDL-R* 54 to spe- 
cifically bind to LDL was demonstrated. The decrease in cell associated radiolabeled LDL in the presence of 
increasing concentrations of proteins from the medium of truncated receptor expressing cells demonstrated 
competitive LDL binding by the protein fragments in the medium. Competitions of similarity severity were not 
observed with equivalent concentrations of proteins from wild-type virus infected cells. 
25 These results of the immunoblots performed as described above indicate that the peptide fragment which 

is produced in the infected insect cells described above is secreted, since it is contained with the media ex- 
tracted from the culture and recognized by the LDL-R specific monoclonal antibody C7. Furthermore, the com- 
petitive binding assay indicates that the fragment does correctly bind to the LDL molecule, with a specificity 
that is at least competitive with the native membrane-bound LDL receptor that is produced in CHO cells. Thus, 
3C for the first time, a soluble form of the LDL receptor has been created, and is now made available for use in 
non-membrane bound research, diagnostic, and therapeutic applications. 
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(1) GENERAL. INFORMATION : 

(i) APPLICANT: 

NAME: Wisconsin Alumni Research Foundation 

STREET : P O BOX 7365 
CITY: Madison, Wisconsin 53707 
COUNTRY: USA 

(ii) TITLE OF INVENTION: Soluble LDL Receptor and Gene 

(iii) NUMBER OF SEQUENCES: 2 

20 (iv) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 EPO 

(v) CURRENT APPLICATION DATA: 
25 APPLICATION NUMBER: EP 93306068.3 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1317 base pairs 
30 ( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

( l i ) MOLECULE TYPE : cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

40 (vii) IMMEDIATE SOURCE: 

(B) CLONE: LDL Receptor 

( ix) FEATURE: 

(A) NAME/KEY: CDC 

(B) LOCATION: 16.. 1086 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTAGAGGCTG CGAGC ATG GGG CCC TGG GGC TGG AAA TTG CGC TGG ACC GTC 51 
Met Gly Pro Trp Gly Trp Lys Leu Arg Trp Thr Val 



50 



15 10 

GCC TTG CTC CTC GCC GCG GCG GGG ACT GCA GTG GGC GAC AGA TGT GAA 99 
Ala Leu Leu Leu Ala Ala Ala Gly Thr Ala Val Gly Asp Arg Cys Glu 
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15 20 25 

AGA AAC GAG TTC CAG TGC CAA GAC GGG AAA TGC ATC TCC TAC AAG TGG 147 

Ajrg Asn Glu Phe Gin Cys Gin Asp Gly Lys Cys lie Ser Tyr Lys Trp 

10 30 35 40 

GTC TGC GAT GGC AGC GOT GAG TGC CAG GAT GGC TCT GAT GAG TCC CAG 19 5 

Val Cys Asp Gly Ser Ala Glu Cys Gin Asp Gly Ser Asp Glu Ser Gin 

45 50 55 60 



15 



20 



25 



GAG ACG TCC TTG TCT GTC ACC TGC AAA TCC GGG GAC TTC AGC TGT GGG 2 43 

Glu Thr Cys Leu Ser Val Thr Cys Lys Ser Gly Asp Phe Ser Cys Gly 
65 70 75 

GGC CGT GTC AAC CGC TGC ATT CCT CAG TTC TGG AGG TGC GAT GGC CAA 291 

Gly Arg Val Asn Arg Cys lie Pro Gin Phe Trp Arg Cys Asp Gly Gin 
80 85 90 

GTG GAC TGC GAC AAC GGC TCA GAC GAG CAA GGC TGT CCC CCC AAG ACG 3 39 

Val Asp Cys Asp Asn Gly Ser Asp Glu Gin Gly Cys Pro Pro Lys Thr 
95 100 105 

TGC TCC CAG GAC GAG TTT CGC TGC CAC GAT GGG AAG TGC ATC TCT CGG 387 

Cys Ser Gin Asp Glu Phe Arg Cys His Asp Gly Lys Cys He Ser Arg 
110 115 120 

CAG TTC GTC TGT GAC TCA GAC CGG GAC TGC TTG GAC GGC TCA GAC GAG 435 

Gin Phe Val Cys Asp Ser Asp Arg Asp Cys Leu Asp Gly Ser Asp Glu 

125 130 135 140 

GCC TCC TGC CCG GTG CTC ACC TGT GGT CCC GCC AGC TTC CAG TGC AAC 48 3 

Ala Ser Cys Pro Val Leu Thr Cys Gly Pro Ala Ser Phe Gin Cys Asn 
145 150 155 

AGC TCC ACC TGC ATC CCC CAG CTG TGG GCC TGC GAC AAC GAC CCC GAC 531 

Ser Ser Thr Cys He Pro Gin Leu Trp Ala Cys Asp Asn Asp Pro Asp 
160 165 170 

TGC GAA GAT GGC TCG GAT GAG TGG CCG CAG CGC TGT AGG GGT CTT TAC 57 9 

Cys Glu Asp Gly Ser Asp Glu Trp Pro Gin Arg Cys Arg Gly Leu Tyr 
175 ISO 185 

GTG TTC CAA GGG GAC AGT AGC CCC TGC TCG GCC TTC GAG TTC CAC TGC 62 7 

^ Val Phe Gin Gly Asp Ser Ser Pro Cys Ser Ala Phe Glu Phe His Cys 
190 195 200 

CTA AGT GGC GAG TGC ATC CAC TCC AGC TGG CGC TGT GAT GGT GGC CCC 67 5 

Leu Ser Gly Glu Cys He His Ser Ser Trp Arg Cys Asp Gly Gly Pro 

205 210 215 220 

45 GAC TGC AAG GAC AAA TCT GAC GAG GAA AAC TGC GCT GTG GCC ACC TGT 72 3 

Asp Cys Lys Asp Lys Ser Asp Glu Glu Asn Cys Ala Val Ala Thr Cys 

225 230 235 

CGC CCT GAC GAA TTC CAG TGC TCT GAT GGA AAC TGC ATC CAT GGC AGC 771 

Arg Pro Asp Glu Phe Gin Cys Ser Asp Gly Asn Cys He His Gly Ser 
240 245 250 
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CGG CAG TGT GAC CGG GAA TAT GAC TGC AAG GAC ATG AGC GAT GAA GTT 819 
Arg Gin Cys Asp Arg Glu Tyr Asp Cys Lys Asp Met Ser Asp Glu val 

255 260 265 

10 GGC TGC GTT AAT GTG ACA CTC TGC GAG GGA CCC AAC AAG TTC AAG TGT 8 67 

Gly Cys Val Asn Val Thr Leu Cys Glu Gly Pro Asn Lys Phe Lys Cys 
270 275 280 

CAC AGC GGC GAA TGC ATC ACC CTG GAC AAA GTC TGC AAC ATG GCT AGA 915 
His Ser Gly Glu Cys lie Thr Leu Asp Lys Val Cys Asn Met Ala Arg 

285 290 295 300 

GAC TGC CGG GAC TGG TCA GAT GAA CCC ATC AAA GAG TGC GGG ACC AAC 963 
Asp Cys Arg Asp Trp Ser Asp Glu Pro lie Lys Glu Cys Gly Thr Asn 
305 310 315 

GAA TGC TTG GAC AAC AAC GGC GGC TGT TCC CAC GTC TGC AAT GAC CTT 1011 
20 Glu Cys Leu Asp Asn Asn Gly Gly Cys Ser His Val cys Asn Asp Leu 
320 325 330 

AAG ATC GGC TAC GAG TGC CTG TGC CCC GAC GGC TTC CAG CTG GTG GCC 1059 
Lys lie Gly Tyr Glu cys Leu Cys Pro Asp Gly Phe Gin Leu Val Ala 
335 340 345 

25 CAG CGA AGA TGC GAA GAT CTA GTC TAGACTAGAT CGATGAGTGT CAGGATCCCG 1113 
Gin Arg Arg Cys Glu Asp Leu Val 

350 355 

ACACCTGCAG CCAGCTCTGC GTGAACCTGG AGGGTGCTAC AAGTGCCAGT GTGAGGAAGG 1173 

30 CTTCCAGCTG ACCCCCACAC GAAGGCCTGC AAGGCTGTGG TCCATCGCCT ACCTCTTCTT 12 3 3 

CACCAACCGG CACGAGGTCA GGAAGATGAC GCTGG AC CGG AGCGAGTACG AATTCCTGCA 12 9 3 

GCCGGGGGTC CGCTAGTTCT AGAA 1317 

35 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : 1 inear 

46 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Pro Trp Gly Trp Lys Leu Arg Trp Thr Val Ala Leu Leu Leu 
15 10 15 



45 



50 



Ala Ala Ala Gly Thr Ala Val Gly Asp Arg Cys Glu Arg Asn Glu Phe 
20 25 30 

Gin Cys Gin Asp Gly Lys Cys lie Ser Tyr Lys Trp Val Cys Asp Gly 
35 40 45 

Ser Ala Glu Cys Gin Asp Gly Ser Asp Glu Ser Gin Glu Thr Cys Leu 



55 
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50 55 60 

ser Val Thr Cys Lys Ser Gly Asp Phe Ser Cys Gly Gly Arg Val Asn 
65 70 75 80 

Arg Cys lie Pro Gin Phe Trp Arg Cys Asp Gly Gin Val Asp Cys Asp 
85 90 95 

Asn Gly Ser Asp Glu Gin Gly Cys Pro Pro Lys Thr Cys Ser Gin Asp 
100 105 110 

Glu Phe Arg Cys His Asp Gly Lys Cys lie Ser Arg Gin Phe Val Cys 
115 120 125 

Asp Ser Asp Arg Asp Cys Leu Asp Gly Ser Asp Glu Ala Ser Cys Pro 

130 135 140 

Val Leu Thr Cys Gly Pro Ala Ser Phe Gin Cys Asn Ser Ser Thr Cys 
145 150 155 160 

lie Pro Gin Leu Trp Ala Cys Asp Asn Asp Pro Asp Cys Glu Asp Gly 

165 170 175 

Ser Asp Glu Trp Pro Gin Arg Cys Arg Gly Leu Tyr Val Phe Gin Gly 
180 185 190 

Asp Ser Ser Pro Cys Ser Ala Phe Glu Phe His Cys Leu Ser Gly Glu 
195 200 205 

Cys lie His Ser Ser Trp Arg Cys Asp Gly Gly Pro Asp Cys Lys Asp 
210 215 220 

Lys Ser Asp Glu Glu Asn Cys Ala Val Ala Thr Cys Arg Pro Asp Glu 

225 230 235 240 

Phe Gin Cys Ser Asp Gly Asn Cys lie His Gly Ser Arg Gin Cys Asp 
245 250 255 

Arg Glu Tyr Asp Cys Lys Asp Met Ser Asp Glu Val Gly Cys Val Asn 
260 265 270 

Val Thr Leu Cys Glu Gly Pro Asn Lys Phe Lys Cys His Ser Gly Glu 
275 280 285 

Cys lie Thr Leu Asp Lys Val Cys Asn Met Ala Arg Asp Cys Arg Asp 

290 295 300 

Trp Ser Asp Glu Pro lie Lys Glu Cys Gly Thr Asn Glu Cys Leu Asp 

305 310 315 320 

Asn Asn Gly Gly Cys Ser His Val Cys Asn Asp Leu Lys lie Gly Tyr 
325 330 335 

Glu Cys Leu Cys Pro Asp Gly Phe Gin Leu Val Ala Gin Arg Arg Cys 
340 345 350 

Glu Asp Leu Val 
355 
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(. INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: DNA probe 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTAGTCTAGA CTAG 



Claims 

1. A water soluble polypeptide having LDL receptor binding affinity for low density lipoprotein (LDL). 

2. A polypeptide according to claim 1 comprising a truncated LDL receptor protein. 

3. A polypeptide according to claim 2 comprising a ligand binding domain and a portion of an epidermal 
growth factor homology domain of the LDL receptor protein. 

4. A polypeptide according to claim 3 comprising a truncation after amino acid 354 of the LDL receptor pro- 
tein. 

5. A polypeptide according to claim 3 or 4 which consists essentially of a truncation of native LDL receptor 
protein. 

6. A polypeptide according to claim 4 or 5 wherein the truncated LDL receptor polypeptide consists of amino 
acid residues 1 to 354 of LDL receptor protein. 

7. A method of making a water soluble polypeptide according to any one of claims 1 to 6, in a host compnsing 
the steps of 

creating a nucleotide sequence comprising a coding sequence coding for a truncated form of a LDL 
receptor gene which does not include the membrane spanning domain, and which further compnses flank- 
ing regulatory sequences effective to express the coding sequence in cells of the host; 

transforming the nucleotide sequence into cells of the host; 

culturing the cells in a medium under conditions effective to produce the polypeptide; and recov- 
ering the polypeptide from the cultured cells. 

8. A method according to claim 7 in which the host is heterologous and eukaryotic and host cells are cultured 
under conditions effective to secrete the truncated LDL receptor polypeptide. 

9. A method according to claim 8 wherein the heterologous host is insect cells in culture. 

10. A method according to claim 10 wherein the insect cells are Sf-21 cells. 

11. A method according to any one of claims 7 to 10 wherein the nucleotide sequence is introduced into the 
host cells through the use of a baculovirus. 
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